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1 A software instruction counter 
J. M. Mellor-Crummey, T. J. LeBlanc 

April 1989 ACM SIGARCH Computer Architecture News , Proceedings of the third 
international conference on Architectural support for programming 
languages and operating systems, volume 17 issue 2 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: ^ pdf(997.70 KB) 



Although several recent papers have proposed architectural support for program debugging 
and profiling, most processors do not yet provide even basic facilities, such as an instruction 
counter. As a result, system developers have been forced to invent software solutions. This 
paper describes our implementation of a software instruction counter for program 
debugging. We show that an instruction counter can be reasonably implemented in 
software, often with less than 10% execution overhead. Ou ... 



Hardware-assisted replay of multiprocessor programs 
David F. Bacon, Seth Copen Goldstein 

December 1991 ACM SIGPLAN Notices , Proceedings of the 1991 ACM/ONR workshop 

on Parallel and distributed debugging, volume 26 issue 12 
Full text available: f&l pdf(1.20 MB) Additional Information: full citation , references , ci tings , index terms 



Replay for concurrent non-deterministic shared-memory applications 
Mark Russinovich, Bryce Cogswell 

May 1996 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 1996 conference 

on Programming language design and implementation, volume 31 issue 5 
Full text available* pdf(968 81 KB) Additional Information: full citation , abstract , references , citings , index 
^ : terms 

Replay of shared-memory program execution is desirable in many domains including cyclic 
debugging, fault tolerance and performance monitoring. Past approaches to repeatable 
execution have focused on the problem of re-executing the shared-memory access patterns 
in parallel programs. With the proliferation of operating system supported threads and 
shared memory for uniprocessor programs, there is a clear need for efficient replay of 
concurrent applications. The solutions for parallel systems can b ... 

Keywords: instruction counter, non-determinism, repeatable execution, shared memory 
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4 The execute operations — a fourth mode of instruction sequencing 
F. P. Brooks 

March 1960 Communications of the ACM, volume 3 issue 3 

Full text available: fHHI pdf(415.00 KB) Additional Information: full citation , references 



A survey of rollback-recovery protocols in message-passing systems 
E. N. (Mootaz) Elnozahy, Lorenzo Alvisi, Yi-Min Wang, David B. Johnson 
September 2002 ACM Computing Surveys (CSUR), volume 34 issue 3 

_ * i . . *i ui « -uiCAneaisr>\ Additional Information: full citation , abstract , references , citings , index 

Full text available: jz\ pdf(549.68 KB) 

^ terms , review 

This survey covers rollback-recovery techniques that do not require special language 
constructs. In the first part of the survey we classify rollback-recovery protocols into 
checkpoint-based and log-based. Checkpoint-based protocols rely solely on checkpointing 
for system state restoration. Checkpointing can be coordinated, uncoordinated, or 
communication-induced. Log-based protocols combine checkpointing with logging of 
nondeterministic events, encoded in tuples call ... 

Keywords: message logging, rollback-recovery 



6 A microprogrammed implementation of EULER on IBM system/360 model 30 
Helmut Weber 

September 1967 Communications of the ACM, volume 10 issue 9 

Full text available: fl!1 pdf(1.34 MB) Additional Information: full citation , references , citings , index terms 



Parallel FRAMs with owned global memory and deterministic context-free language 
recognition 

Patrick W. Dymond, Walter L. Ruzzo 

January 2000 Journal of the ACM (J ACM), volume 47 issue i 

Additional Information: full citation , abstract , references , index terms . 



Full text available: *p| pdf(223.64 KB) 

^ review 

We identify and study a natural and frequently occurring subclass of Concurrent Read, 
Exclusive Write Parallel Random Access Machines (CREW-PRAMs). Called Concurrent Read, 
Owner Write, or CROW-PRAMS, these are machines in which each global memory location is 
assigned a unique "owner" processor, which is the only processor allowed to write into it. 
Considering the difficulties that would be involved in physically realizinga full CREW-PRAM 
model and demonstrate i ... 

Keywords: CROW-PRAM, DCFL recognition, owner write, parallel algorithms 



Design problems in emulating the MIX computer on the Microdata 1600 
T. Don Dennis, O. G. Johnson 

September 1976 Proceedings of the 9th annual workshop on Microprogramming 

Full text available: *^ pdf(388.01 KB) Additional Information: full citation , abstract , references , index terms 

This paper presents an overview of an emulator for the MIX computer written in Microdata 
1600 microcode. The MIX computer thus emulated is a variant of the original MIX computer 
as described in Volume 1 of The Art of Computer Programming by Donald Knuth. Basic 
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changes involve the utilization of 8 bit bytes along with the ASCII character code. 



9 System Specifications for the DYSEAC Q 
Alan L. Leiner 

April 1954 Journal of the ACM (JACM), Volume i issue 2 

Full text available: ^ pdf(1.40 MB) Additional Information: full citation , ci tings , index terms 



10 SPAM: a microcode based tool for tracing operating system events I 
Stephen W. Melvin, Yale N. Patt 

December 1987 Proceedings of the 20th annual workshop on Microprogramming 

r- ... ^ , u, ffi MAnc k Additional Information: full citation , abstract , references , citings , index 

Full text available: l Pi pdf(405.55 KB) ; 

terms 

We have developed a tool called SPAM (for System Performance Analysis using Microcode), 
based on microcode modifications to a VAX 8600, that traces operating system events as a 
side-effect to normal execution. This trace of interrupts, exceptions, system calls and 
context switches can then be processed to analyze operating system behavior for the 
purpose of debugging, tuning or development. SPAM allows measurements to be made on a 
fully operating UNIX system with little perturbation (typica ... 

11 Efficiently counting program events with support for on-line queries I 
Thomas Ball 

September 1994 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 16 Issue 5 

r- .. * ^ .. fit , /7flil 7e Additional Information: full citation , abstract , references , citings , index 

Full text available: Tf*] pdf(784.76 KB) ; 

terms 1 

The ability to count events in a program's execution is required by many program analysis 
applications. We represent an instrumentation method for efficiently counting events in a 
program's execution, with support for on-line queries of the event count. Event counting 
differs from basic block profiling in that an aggregate count of events is kept rather than a 
set of counters. Due to this difference, solutions to basic block profiling are not well suited to 
event counting. Our algorithm fin ... 

Keywords: control-flow graph, counting, instrumentation 



12 Evaluation and performance of computers: the program monitor — a device for program Q 
performance measurement 

c. T. Apple 

August 1965 Proceedings of the 1965 20th national conference 

Full text available: ^ pdff 856.22 KB) Additional Information: full citation , abstract , citings , index terms 

IN 1961 A GROUP was established within IBM to test systems programs before they were 
released for customer usage. The goal of this group was to assure IBM management that 
each program released would be satisfactorily usable by the customer. One step taken by 
this group was to develop a monitor device which would permit programmers to record 
information being handled by the CPU during execution. Their intent was to use this 
recorded information to analyze the basic nature of progr ... 

13 Microprogramming revisited Q 
Michael J. Flynn, M. Donald McLaren 

January 1967 Proceedings of the 1967 22nd national conference 

Additional Information: 
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full citation , abstract , references , citings , index 
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It is the objective of this paper to briefly trace the history of the idea and the difficulties 
involved with defining or implementing it. In doing this, we first consider the general control 
problem and instruction formats. Next, storage implementations of the control function are 
considered and a restricted definition of microprogramming is proposed. This is then 
evaluated from a technological, architectural and programming point of view. We hope to 
show that our (demanding) definition of ... 

14 Introducing computer concepts by simulating a simple computer Q 
Robert A. Campbell 

September 1996 ACM SIGCSE Bulletin, volume 28 issue 3 

Full text available: ^pdf(217.65 KB) Additional Information: full citation , abstract , citings , index terms 

The simulated computer consists of (1) main memory, (2) a register known as the 
accumulator, (3) a central processing unit (CPU), and (4) an instruction counter. This 
computer recognizes 8 op codes (Halt, Load, Store, Add, Subtract, Read, Write, and Branch 
On Zero). The computer is simulated by creating a program in Pascal or C++. This program 
simulates the execution of programs written by students, such as adding two numbers and 
printing their sum. Student programs are written in machine langua ... 

15 Optimal tracing and incremental reexecution for debugging long-running programs Q 
Robert H. B. Netzer, Mark H. Weaver 

June 1994 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 1994 conference 

on Programming language design and implementation, volume 29 issue 6 
Full text available: f^pdf(1.34 MB) Additional Information: full citation , references , citings , index terms 



16 Race Frontier: reproducing data races in parallel-program debugg ing Q 
Jong-Deok Choi, Sang Lyul Min 

April 1991 ACM SIGPLAN Notices , Proceedings of the third ACM SIGPLAN symposium 

on Principles and practice of parallel programming, volume 26 issue 7 
Full text available: ^ pdf(1.05 MB) Additional Information: full citation , references , ci tings , index terms 



17 Special session on memory wall: Fighting the memory wall with assisted execution Q 
Michel Dubois 

April 2004 Proceedings of the first conference on computing frontiers on Computing 
frontiers 

Full text available: ^pdf(231.18 KB) Additional Information: full citation , abstract , references , index terms 

Assisted execution is a form of simultaneous multithreading in which a set of auxiliary 
"assistant" threads, called nanothreads, is attached to each thread of an application. 
Nanothreads are lightweight threads which run on the same processor as the main 
(application) thread and help execute the main thread as fast as possible. Nanothreads 
exploit resources that are idled in the processor because of hazards due to program 
dependencieis and memory access delays.Assisted execution has the po ... 

Keywords: cache memories, latency tolerance, prefetching, simultaneous multithreading, 
superscalar processors 



18 Debugging standard ML without reverse engineering Q 
Andrew P. Tolmach, Andrew W. Appel 



http://portal.acm.org/results.cfm?coll=ACM&dl=ACM&CFID=32342^ 11/24/04 



May 1990 Proceedings of the 1990 ACM conference on LISP and functional 
programming 

r- ^ -I u. « ^, on Additional Information: full citation , abstract, references , citings, index 

Full text available: T ig pdff 1.29 MB) 

fc^-" terms 

We have built a novel and efficient replay debugger for our Standard ML compiler. 
Debugging facilities are provided by instrumenting the user's source code; this approach, 
made feasible by ML's safety property, is machine-independent and back-end independent. 
Replay is practical because ML is normally used functionally, and our compiler uses 
continuation-passing style; thus most of the program's state can be checkpointed quickly 
and compactly using call-with-current-continuation. Together, ... 

19 Record/replay for nondeterministic program executions 

Michiel Ronsse, Koen De Bosschere, Mark Christiaens, Jacques Chassin de Kergommeaux, 
Dieter Kranzlmuller 

September 2003 Communications of the ACM, volume 46 issue 9 

r- ., x ^ ., u fiii ^/-^ n« i/Di Additional Information: full citation , abstract , references , citings , index 

Full text available: *p] pdf(106.09 KB) 

tex*^-* terms 

Controlling the nondeterministic features within multithreaded and highly responsive 
applications enables the continued use of all traditional software development techniques. 

20 Formal papers: A microprogrammed implementation of a block structured architecture 
Michael J. Lutz, Michael J. Manthey 

September 1972 Conference record of the 5th annual workshop on Microprogramming 

Full text available: *|£) pdf(919.89 KB) Additional Information: full citation , abstract , references , ci tings 

This paper reports our current progress at SUNY/Buffalo in building an emulator for a block 
structured architecture. We briefly describe the overall research effort and the architecture 
of the target machine. We next discuss the mapping of the emulator program onto the 
available resources and the logical structure of the emulator. Finally we describe the current 
and projected micro-level instrumentation of the emulator which forms the basis for future 
experimentation with the architecture. 
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